An Efficient Estimator of the Mutation Parameter and Analysis of Polymorphism from the 1000 Genomes Project
نویسنده
چکیده
The mutation parameter θ is fundamental and ubiquitous in the analysis of population samples of DNA sequences. This paper presents a new highly efficient estimator of θ by utilizing the phylogenetic information among distinct alleles in a sample of DNA sequences. The new estimator, called Allelic BLUE, is derived from a generalized linear model about the mutations in the allelic genealogy. This estimator is not only highly accurate, but also computational efficient, which makes it particularly useful for estimating θ for large samples, as well as for a large number of cases, such as the situation of analyzing sequence data from a large genome project, such as the 1000 Genomes Project. Simulation shows that Allelic BLUE is nearly unbiased, with variance nearly as small as the minimum achievable variance, and in many situations, it can be hundreds- or thousands-fold more efficient than a previous method, which was already quite efficient compared to other approaches. One useful feature of the new estimator is its applicability to collections of distinct alleles without detailed frequencies. The utility of the new estimator is demonstrated by analyzing the pattern of θ in the data from the 1000 Genomes Project.
منابع مشابه
Analysis of Record Data from the Scaled Logistic Distribution
In this paper, we consider the estimation of the unknown parameter of the scaled logistic distribution on the basis of record values. The maximum likelihood method does not provide an explicit estimator for the scale parameter. In this article, we present a simple method of deriving an explicit estimator by approximating the likelihood function. Bayes estimator is obtained using importance samp...
متن کاملBayesin estimation and prediction whit multiply type-II censored sample of sequential order statistics from one-and-two-parameter exponential distribution
In this article introduce the sequential order statistics. Therefore based on multiply Type-II censored sample of sequential order statistics, Bayesian estimators are derived for the parameters of one- and two- parameter exponential distributions under the assumption that the prior distribution is given by an inverse gamma distribution and the Bayes estimator with respect to squared error loss ...
متن کاملStudy on DGAT1-exon8 Polymorphism in Iranian Buffalo
Objective: Diacylglycerol acyltransferase 1 is a microsomal enzyme that catalyzes the final step of triglyceride synthesis. The objective of this project is to check out the polymorphism at the exon 8 region of DGAT1 gene using PCR-SSCP technique in Iranian water buffaloes (Bubalus bubalis). Recent activities have shown that a significant association between lysine at amino acid positi...
متن کاملStudy on DGAT1-exon8 Polymorphism in Iranian Buffalo
Objective: Diacylglycerol acyltransferase 1 is a microsomal enzyme that catalyzes the final step of triglyceride synthesis. The objective of this project is to check out the polymorphism at the exon 8 region of DGAT1 gene using PCR-SSCP technique in Iranian water buffaloes (Bubalus bubalis). Recent activities have shown that a significant association between lysine at amino acid positi...
متن کاملAdmissible and Minimax Estimator of the Parameter $theta$ in a Binomial $Bin( n ,theta)$ distribution under Squared Log Error Loss Function in a Lower Bounded Parameter Space
Extended Abstract. The study of truncated parameter space in general is of interest for the following reasons: 1.They often occur in practice. In many cases certain parameter values can be excluded from the parameter space. Nearly all problems in practice have a truncated parameter space and it is most impossible to argue in practice that a parameter is not bounded. In truncated parameter...
متن کامل